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^ . Abstract 

In this paper, we study the design and analysis of optimal detection scheme for sensors that are 
J> , deployed to monitor the change in the environment and are powered by the energy harvested from 

. the environment. In this type of applications, detection delay is of paramount importance. We model 
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this problem as quickest change detection problem with a stochastic energy constraint. In particular, 



' a wireless sensor powered by renewable energy takes observations from a random sequence, whose 

distribution will change at a certain unknown time. Such a change implies events of interest. The energy 



in the sensor is consumed by taking observations and is replenished randomly. The sensor cannot take 
observations if there is no energy left in the battery. Our goal is to design a power allocation scheme 
and a detection strategy to minimize the worst case detection delay, which is the difference between the 
time when an alarm is raised and the time when the change occurs. Two types of average run length 
(ARL) constraint, namely an algorithm level ARL constraint and an system level ARL constraint, are 
considered. We propose a low complexity scheme in which the energy allocation rule is to spend energy 
to take observations as long as the battery is not empty and the detection scheme is the Cumulative Sum 
test. We show that this scheme is optimal for the formulation with the algorithm level ARL constraint 
and is asymptotically optimal for the formulations with the system level ARL constraint. 
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cumulative sum test; energy harvesting sensor; non-Bayesian quickest detection; sequential detec- 
tion. 

I. Introduction 

Recently, the study of sensor networks powered by renewable energy harvested from the 
environment has attracted considerable attention [DQ-[|5]. Compared with the sensor networks 
powered by batteries, the sensor networks powered by renewable energy have several unique 
features such as unlimited life span and high dependence on the environment etc. Optimal 
power management schemes for each individual sensor and scheduling protocols for the whole 
network have been developed to maximize utility functions of communication related metrics 
such as channel capacity, transmission delay or network throughput. However, besides these 
communication related metrics, there are other signal processing related performance metrics 
that are also important for sensor networks targeted for certain applications. For example, if a 
sensor network is deployed to monitor the health of a bridge, then the detection delay between 
the time when a structural problem occurs and the time when an alarm is raised is of interest. 
As another example, if a sensor network is deployed for intruder detection, then the detection 
delay and the false alarm probability are of interest. 

Until now, these alternative but important performance metrics have not been investigated 
for sensors powered renewable energy. In this paper, we focus on the design of optimal power 
management schemes for such wireless sensor networks when the detection delay is of interest. In 
particular, we focus on so called "quickest detection" problem. In the quickest detection problem, 
wireless sensors are deployed to quickly detect the change (these terms will be precisely defined 
in the sequel) in the environment. Such changes typically imply certain activities of interest. For 
example, in the bridge monitoring, a change may imply that a certain structural problem has 
occurred in the bridge. As the result, it is of paramount importance to minimize the detection 
delay after the presence of such a change, hence the name of quickest detection. Besides this 
application, quickest detection also has many other potential applications, such as the quality 
control BU, network intrusion detection [7], cognitive radio jH, etc. We note that the detection 
delay in the change point detection problem refers to the delay between the time when a change 
occurs and the time when an alarm is raised. It is not the delay from time zero to the time when 
an alarm is raised, since we are interested in the change. 
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Non-Bayesian quickest detection is one of the most important formulations, which was first 
studied by G. Lordon [9| and M. Pollak ifTOl . Under the non-Bayesian setup, a sensor sequentially 
observes a random sequence {X k ,k = 1,2,...} with a fixed but unknown change point t. 
Before the change point t, the sequence X 1 , . . . , X t ^i are independent and identically distributed 
(i.i.d.) with probability density function (pdf) f , and after t, the sequence are i.i.d. with pdf f±. 
Under an average run length (ARL) to false alarm constraint, namely the expected duration to 
a false alarm is at least 7, Lorden's setup is to minimize the "worst-worst case" detection delay 
sup t esssup E 4 [(T — t + + . . . , where T is the stopping time at which an alarm is 

raised, while Pollak's setup is to minimize the "worst case" conditional average detection delay 
sup t E t [(T — t)\T > t). Since no prior information about the change point is required, these 
non-Bayesian setups are very attractive for practical applications. 

In the above mentioned classic setups, there is no energy constraint and the sensor can take 
observations at every time slot. In this paper, we extend Lorden's and Pollak's problems to 
sensors that are powered by renewable energy. In this case, the energy stored in the sensor is 
replenished by a random process and consumed by taking observations. The sensor cannot take 
observations if there is no energy left. Hence, the sensor cannot take observations at every time 
instant anymore. The sensor needs to plan its use of power carefully. Moreover, the stochastic 
nature of the energy replenishing process will certainly affect the performance of change detection 
schemes. Since the energy collected by the harvester in each time instant is not a constant but 
a random variable, this brings new optimization challenges. 

We first consider the scenario in which a unit of energy arrives with probability p at each 
time instant. For Lorden's setup, two types of ARL constraint are considered in this paper. 
The first type is an algorithm level ARL constraint, which puts a lower bound on the expected 
number of observations taken by the sensor before it runs a false alarm. The algorithm level 
ARL constraint is independent of the energy arriving probability p. Under this setup, we prove 
that the optimal detection procedure is the well known cumulative sum (CUSUM) procedure 
proposed in [9|, and the optimal power allocation scheme is to allocate the energy as soon as it 
is harvested. The second type ARL constraint is on a system level, which puts a lower bound on 
the expected duration to a false alarm. This constraint is related to the energy arriving probability 
p. In this case, we show that CUSUM procedure and the immediate power allocation strategy 
is asymptotically optimal when the system ARL goes to infinity. For Pollak's setup, we discuss 
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the problem only with the system level ARL in detail. As we can see later, the immediate power 
allocation coupled with CUSUM detection is actually asymptotically optimal for both the system 
level ARL and the algorithm level ARL. 

We then consider a more general energy arriving process in which more than one unit of 
energy can arrive at each time instant. In this scenario, we show that a simple energy allocation 
policy, in which the sensor takes samples as long as there is energy left at the battery, coupled 
with CUSUM test is asymptotically optimal for both Lorden and Pollak's setups when the system 
level ARL goes to infinity. 

There have been some existing works on the quickest change point detection problem that take 
the sample cost into consideration. The first main line of existing work considers the problem 
under a Bayesian setup. The main difference between the Bayesian setup and non-Bayesian setup 
is that in the Bayesian setup, the change point is modeled as a random variable with a known 
distribution. No such assumption is made in the non-Bayesian setup. considers the design of 
detection strategy that strikes a balance between the detection delay, false alarm probability and 
the number of sensors being active. In particular, [7] considers a wireless network with multiple 
sensors monitoring the Bayesian change in the environment. Based on the observations from 
sensors at each time slot, the fusion center decides how many sensors should be active in the 
next time slot to save energy. iTTTTl take the average number of observations into consideration, 
and provides the optimal solution along with low-complexity but asymptotically optimal rules. 
In lfT2ll . the authors propose a DE-CUSUM scheme for the non-Bayesian setup and show that it 
is asymptotically optimal. 

The remainder of this paper is organized as follows. The mathematical model is given in 
Section HB Section [III] presents the optimal solution for Lorden's problem under the algorithm 
level ARL constraint and the performance analysis for the optimal solution. In Section [IV] we 
present asymptotically optimal solutions for Lorden's and Pollak's problems under the system 
level ARL constraint. Section [V] presents our results for a more general energy arriving model. 
Numerical examples are given in Section |Vl]to illustrate the results obtained in this work. Finally, 
Section IVIII offers concluding remarks. 
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II. Problem Formulation 

Let {X k ,k = 1,2,. . .} be a sequence of random variables whose distribution changes at a 
fixed but unknown time t. Before t, the {X^j's are i.i.d. with pdf / ; after t, they are i.i.d. with 
pdf fx. The pre-change pdf f and post-change pdf fx are perfectly known by the sensor. We 
use P t and E t to denote the probability measure and the expectation with the change happening 
at t, respectively, and use P^ and Eqo to denote the case t = oo. 

We assume that the energy arrives randomly at each time slot. To facilitate the presentation 
and set up notation, we present the model for the case when the energy arriving process is a 
Bernoulli process with parameter p in this section. A more general model will be considered in 
Section |V] Specifically, we use v = {ux, v 2 , . . . ,Vk, • • •} to denote the energy arriving process 
with z/fc G {0, 1}, in which v k — 1 indicates that a unit of energy is collected by the energy 
harvester at time slot k and v k = means that no energy is harvested. {u k } is i.i.d. over k. 
Moreover, we use P u to denote its probability measure (correspondingly, we use W to denote 
the expectation according to the measure P v ), and we have P v {y k = 1) = V- 

The sensor can decide how to allocate these collected energies. Let \l = {/ii, /x 2 , • • . , • • • } 
be the power allocation strategy, where fj, k ^ {0? 1}> m which fi k = 1 means that the wireless 
sensor spends a unit of energy on taking an observation at time slot k, while fi k = means that 
no energy is spent at time k and hence no observation is taken. 

The sensor's battery has a finite capacity C . The energy arriving process and the energy 
utilizing process will affect the amount of energy left in the battery. We use E k to denote the 
energy left in the battery at the end of time slot k. E k evolves according to: 

E k = minfC, E k _x + v k - /j, k ]. 

The energy allocation policy ji must obey the causality constraint, namely the energy cannot 
be used before it is harvested. The energy causality constraint can be written as 

E k >0 A; = 1,2, .... (1) 

We use hi to denote the set of all /i's that satisfy ([I]). 

The sensor spends energy to take observation. The observation sequence is denoted as 
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{Z k , k = 1, 2, . . .}, where 

X k if u fc = 1 

^ . (2) 

(j) if yUfc = 

We call an observation Z k a non-trivial observation if ji k = 1, i.e., if the observation is taken 
from the environment. 

{Z k ys are not necessarily conditionally (conditioned on the change point) i.i.d. due to the 
existence of {fi k }. The distribution of Z k is related to both fx k and X k . Therefore, we use P£ 
and E^ 4 to denote the probability measure and expectation of the observation sequence {Z k } 
with the change happening at t, respectively. 

In this paper, we want to find a stopping time T, at which the sensor will declare that a change 
has occurred, and a power allocation rule /i that jointly minimize the detection delay. Clearly, 
the power allocation strategy fi k depends causally on the observation process, the energy arriving 
process and the energy utilization process: 

fJLk = 9k(Zi~ 1 ,Vi,f4~ 1 ), 

in which Z^ 1 denotes the vector [Z x , . . . , Z k _i\, v\ and /zj -1 are defined similarly, and g k is 
the power allocation function used at time slot k. 

We consider three problem setups. The first one is Lorden's quickest detection problem with 
an algorithm level ARL constraint, which is formulated as 

(PI) min d(u,T), 

s.tE OB [N]>ri, (3) 

where T is the set of all stopping time with [T] < oo, N is the total number of non-trivial 
observations taken by the sensor before it claims that the change has happened and 

d(/i,T) = sup d t (/i, T), 
t>i 

d t (fi,T) = esssup El 1 [(T - t + 1)+| F t -i] , (4) 

where T k is the set of all observations till time k, namely T k — {Z l7 ■ ■ ■ , Z k }. In this case, we 
put a lower bound 77 on the average number of observations taken before a false alarm is raised. 
The larger 77 is, the less frequent a false alarm will be raised. Since this constraint is independent 
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of the power allocation scheme fi and energy arriving sequence v, this problem setup is robust 
against the variation of the ambient environment. 

The second problem considered in this paper is Lorden's quickest detection problem with a 
system level ARL constraint, which is formulated as 

(P2) min d(u,T), 

s.t. E^T] > 7. (5) 

In this formulation, a lower bound is set on the expected duration to a false alarm. In contrast 
to the previous case, this constraint depends on the power allocation /i, which is further related 
to the energy arriving probability p. Therefore, this setup is more sensitive to the environment. 

In some applications, Pollak's formulation is of interest since its delay metric is less con- 
servative than that of Lorden's formulation. In our context, Pollak's formulation can be written 
as 

(P3) min sup Ef [T — t\T > t] , 

MGW.rer t >x 1 - J 

s.t.E^[T]> 7 - (6) 

Even without the additional energy casuality constraint, the optimal solution for Pollak's for- 
mulation is still unknown. Therefore, in this paper, we discuss only the asymptotic solution for 
Pollak's formulation. In the sequel, we will see that the proposed asymptotically optimal solution 
under the system level ARL constraint is also asymptotically optimal under the algorithm level 
ARL constraint. Hence, in the paper, we discuss only the system level ARL constraint for Pollak's 
formulation in detail. 

For an arbitrary realization of the power allocation scheme fi, we will use the following 
notation throughout of the paper: 

1) {dfc,A; = l,2,...}to denote the time instants at which the energy harvester harvests a unit 
of energy, i.e., u ak = 1; 

2) {b k , k — 1, 2, . . .} to denote the time instants at which the sensor takes observations, i.e., 

/% = i; 

3) fxj^ k ' bk \ k — 1, 2, . . .j or k — 1, 2, . . .j to denote the non-trivial observation se- 
quence, which is the subsequence of {Z k ,k = 1,2,...} with all its non-trivial elements. 
In particular, xj^ h ' bk ^ will be used when we want to emphasize the sampling time. Here 



X {a k ,b k ) ^ kth 

non-trivial observation taken by the sensor at time b k using the energy 
arriving at time a k . 

Using above notation, the energy causality constraint indicates the following inequality: 

h>a k , k = l,2, .... (7) 

An example of the realization of the sensor sampling procedure (and corresponding notation) 
is shown in Figure [Q 
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Fig. 1: An example of the realization of the sampling procedure 



III. Optimal solution forLorden's formulation with the algorithm level 

ARL CONSTRAINT 

In this section, we study the optimal solution for (PI). We use L(-) to denote the likelihood 
ratio (LR), and use /(•) = logL(-) to denote the log likelihood ratio (LLR). For the observation 
sequence {Z k }, LR is defined as 



L(Z k ) 



fl ( ) * -P 1 

1, if n k = 



(8) 



The CUSUM statistic and Page's stopping time can be written as O 



Sk = max 

l>q>k 



i=q 



max[5 ft _i, l}L(Z k 



and 



T p = M{k > 0\S k > B], 



respectively. 

Generally, for a given detection strategy pair (/x, T), the detection delay d t (fx, T) in © varies 
from different change point t. If there is an equalizer strategy which makes d 4 (/x, T) be a constant 
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over t, it might be a good candidate for the optimal strategy for the minmax problem. Similar to 
the conclusion that Page's stopping time is an equalizer rule for the classical Lorden's problem 
|fT3l , we have following proposition: 

Proposition 3.1: The power allocation scheme fi* = v and Page's stopping time T p together 
achieve an equalizer rule, i.e., d t (fi*,T p ) = d\{jJL*, T p ), Vt > 1. 

Proof: Since //* = v indicates that {//£}'s are i.i.d. over k, {-Zfcj's are conditionally i.i.d. 
given the change point t. 

Notice that Wk = max.[Sk, 1] is a non-decreasing function of Sk, and on the event {T p > t}, 
T p is a non-increasing function of W t -i. Then we have 



Since {Wk} is a homogeneous Markov chain under the power allocation scheme ji* k = then, 



Remark 3.2: jj* = v indicates n* k = v k for every k, that is, the sensor spends the energy 
taking observation immediately when it obtains an energy from the environment. Therefore, we 
call /i* the immediate power allocation scheme in the sequel. 

The next lemma shows that the immediate power allocation scheme along with the CUSUM 
detection scheme is optimal for (PI). 

Lemma 3.3: The optimal power allocation strategy for (PI) is //*, and the optimal stopping 
time is T p with the threshold B being a constant such that EoofiV] = rj. 

Proof: The proof consists of two steps. The first step is to show that for an arbitrary but 
given power allocation /x, T p is the optimal stopping time. The second step is to show that under 
T p , /i* is the optimal power allocation scheme. A detailed proof is provided in Appendix [A] ■ 

In the following, we analyze the performance of (fi*,T p ) by determining the detection delay 
and the algorithm level ARL. Since {Zk} is a conditionally i.i.d. sequence under p*, we can 
apply Wald's lemma [f3| in our analysis. We have the following proposition: 

Proposition 3.4: Suppose B > 1, then 



esssup [T p — t + 1 1 J"t-i] 
Ef [T p -t+l|W t _ 1 = l]. 



(9) 



dt(fi*,T p ) =d 1 (fi*,T p ). 



Eoo[iV] 



EqoM 



(10) 



d(fi*,T p ) 



1-PooW 

1 Ei[«] 

pi-Pi{F y 



(ii) 
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where k is the stopping time 

K = min ( m > 1 
and F denotes the event 

( m 

v < 



m 

5^(l fc ) £(0,logi?) 
fc=i 



fc=i 

Proof: The proof follows closely that of Theorem 6.2 in ffT3l . A detailed proof is given in 
Appendix El ■ 

We note that in Proposition 13 .41 ARL and d(/j*,T£) are given as functions of Poo(-fo) and 
Pi(F ), whose precise values are difficult to evaluate. The following result, which is an extension 
of Lorden's asymptotical result [9], shows d(/j,*,T£) scales linear with log?] when 77—7-00. 

Proposition 3.5: As rj — > 00, we have 

c?(/i ,T P ) ~ — , (12) 

p i 

in which / = I(fi, /o) is the KL divergence of fi and / . 

Proof: This statement can be shown by discussing the relationship between one-sided 
sequential probability ratio test (SPRT) and CUSUM. The discussion is similar to the proof 
of Lemma 14.21 therefore, we omit the proof for brevity. ■ 

IV. Asymptotically optimal solution under the system level ARL constraint 

In this section, we consider (P2) and (P3). Since both the detection delay and the system 
level ARL constraint are related to the power allocation fi, it is generally difficult to solve these 
coupled problems. Inspired by the previous section, we propose to use the simple detection 
strategy (fi*,T p ). We will show that this simple strategy is asymptotically optimal for (P2) and 
(P3) as 7 — > 00. 

The asymptotic optimality of (/i*,T p ) in the rare false alarm region (7 — > 00) can be shown 
by two steps. In the first step, we derive a lower bound on the detection delay for any power 
allocation and detection scheme. In the second step, we show that (fi*,T p ) achieves this lower 
bound, which then implies that (fi*,T p ) is asymptotically optimal. 

The following lemma presents our lower bound on the detection delay. 



1 1 



Lemma 4.1: As 7 — > 00, 

inf M/i, T) : E^[T] > 7} 
> inf(supE?[T-t|T>*]:E^[T]>7 
1 I log7 



> -- 

P 



[1 + 0(1)). 



(13) 



Proof: Please see Appendix ■ 

This lower bound | log7|(p/)~ 1 (l +o(l)) can be obtained by (p*,T p ) for both (P2) and (P3). 

More specifically, we have 

Lemma 4.2: (/x*,T p ) is asymptotically optimal for (P2) as 7 — > 00. Specifically, 

>/ * ^ 1 I log 7 1 
p 1 

Proof: Please see Appendix [D] 



(14) 



Lemma 4.3: (/x*,T p ) is asymptotically optimal for (P3) as 7 

sup if [T p - t\T p > t] ~ 
Proof: Please see Appendix El 



00. Specifically, 



(15) 



As we mentioned in Section HH although we consider Pollak's formulation only under the 
system level ARL constraint in detail in this paper, the proposed strategy (fi*,T p ) is also 
asymptotically optimal for the formulation under the algorithm level ARL constraint, which 
is stated in the following proposition: 

Proposition 4.4: (fx*,T p ) is asymptotically optimal for Pollak's formulation under the algo- 
rithm level ARL constraint as 77 — > 00, and we have 

1 I log 77 1 
P 



supEf [T p -t\T p > t] 



t>i p I 

Proof: Following the similar argument used in Proposition 13.41 we have 

" N 
5> 



(16) 



e£[t p ] = e£m=e£ 



1=1 



-EoofiV]. 
P 



That is, under the immediate power allocation //*, the algorithm level ARL constraint EoofiV] > rj 
can be equivalently converted into a system level ARL constraint E^[T P ]. Setting 7 = rj/p for 
a given p, r] — > 00 is equivalent to 7 — > 00. By Lemma H31 (yU*,T p ) is asymptotically optimal 
under the system level ARL constraint, hence it is asymptotically optimal under the algorithm 
level ARL constraint. ■ 
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V. Extension 

In this section, we extend the original problem setup by assuming that the energy harvester 
can receive more than one unit energy at each time slot. Specifically, we assume that the energy 
arriving sequence v = {vi, . . . , u k , . . .} is i.i.d. over k. v k 6 V = {0, 1, 2, . . .}, in which {v k = 0} 
means that the energy harvester collects nothing at time slot k and {u k = i} means that the 
energy harvester collects i units of energy at time k. We use pi = P v {y k = i) to denote its 
probability mass function (pmf). Then the energy left in the battery at the end of time slot k is 
updated by 

E k = min{C, E k _ x + u k - /i k }, 

and the energy causality constraint indicates E k > 0. 

Under this setup, we consider (P2) and (P3). We propose to use a generalized immediate 
power allocation strategy: 

1 if E k „ x + u k >\ 
if + u k = 
That is, the sensor keeps taking observations as long as the battery is not empty. 

In the following, we show that this generalized immediate power allocation ji* combined 
with Page's stopping time T p is asymptotically optimal for (P2) and (P3) in this random energy 
arriving case. Corresponding to Lemma |4~T1 Lemma |4~21 and Lemma H31 we have following two 
lemmas: 

Lemma 5.1: As 7 — > 00, 

mfM/i,T):E^[T]>7} 

> mf| S u P Enr-t|r>t]:E^[r]> T | 

> -J-^(l + o(l)), (17) 
p 1 

where p = E U [ji*} . 

Proof: Please see Appendix |FJ ■ 
Lemma 5.2: (fl*,T p ) is asymptotically optimal for (P2) and (P3) as 7 — > 00. Specifically, 

d(^T p )~ii^, (18) 
p 1 
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and 

supEf [T p -t\T p >t}~ -i-^fA (19) 
t>i p i 

Proof: Please see Appendix [Gl ■ 

VI. Numerical Simulation 

In this section, we give some numerical examples to illustrate the analytical results obtained 
in this paper. In these numerical examples, we assume that the pre-change distribution f is zero 
mean Gaussian with variance a 2 and the post-change distribution fi is zero mean Gaussian with 
variance P + a 2 . In this case, the KL divergence is /(/i,/o) = \ 
signal-to-noise ratio is defined as SNR = 101ogP/cr 2 . 

In the first example, we illustrate the equalizer property of (/i*, T p ) under Lorden's formulation. 
The equalizer property plays a critical role in the performance analysis, since it allows us to 
study d(n*,T p ) through a relatively simple expression [T p ]. In this example, we compare 
our optimal strategy with a seemingly reasonable strategy: a save-test power allocation scheme 
combined with CUSUM. The save-test power allocation is a two-threshold strategy: 1) The 
sensor saves the collected energy for future use if the energy stored in the sensor is less than a 
threshold c\ and the CUSUM statistic is less than threshold c 2 ; and 2) the sensor takes observation 
when either of these two thresholds is exceeded. This rule says that if the CUSUM statistic is 
low (suggesting that a change has not happened yet) and the energy stored in the sensor is 
low, the sensor saves its energy. On the other hand, if either the sensor has enough energy, or 
the CUSUM statistic is high, the sensor should take an observation. In this simulation, we set 
a 2 = 1, SNR = OdB, p = 0.5 and 7 = 560. The simulation result is shown in Figure |2] In the 
figure, the blue line with circles is the performance of (/i*,T p ), the green dash line with stars is 
the performance of the save-test power allocation with CUSUM. This simulation confirms our 
analysis that (/i*,T p ) is an equalizer rule, i.e., g?i(/i* ,T p ) = d t ([i*,T p ). However, the save-test 
power allocation scheme along with CUSUM is not an equalizer rule. Actually, in the save-test 
power allocation scheme, di(/j,T) is larger than others. This is due to the fact that in the first 
time slot, both the CUSUM statistic and the energy stored in the sensor are zero, hence the 
sensor chooses to store its energy. The sensor will not take observations until the stored energy 
exceeds c 2 . The duration of this energy collection period is independent of the change point. 



lo glTPA7 
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Then, the worst case happens at t — 1, and the detection delay caused by the energy collection 
period is larger than that caused by the immediate power allocation. Since Lorden's performance 
metric focuses on the worst case, the save-test power allocation is not as good as the immediate 
power allocation. 



— e — immediate power allocation scheme 
* - save-test power allocation scheme 



40 60 
t 



Fig. 2: The change point t vs d t (T p 



In the second example, we illustrate the relationship between the detection delay and the 
expected number of observations to false alarm with respect to the energy arriving probability 
p under setup (PI). In this simulation, we set a 2 = 1, SNR = OdB. The simulation result is 
shown in Figure |3] In this figure, the blue line with circles is the simulation result for p = 0.2, 
the green line with stars and the red line with squares are the results for p = 0.5 and p = 0.8, 
respectively. The black dash line is the performance of the classical Lorden's problem, which 
serves as a lower bound since in this case the sensor can take observations at every time slot. 
As we can see, for a given t], the detection delay is in inverse proportion to the energy arriving 
probability p. The larger p is, the closer is the performance to the lower bound. 

In the third scenario, we examine the asymptotic optimality of (yU*, T p ) for (P2) and (P3). In this 
simulation, we set p = 0.3, a 2 = 1 and SNR = 5dB. In this case, we have I(f±, fo) = 0.8681. 
The simulation result is shown in Figure |4] In this figure, the blue line with circles is the 
performance of (P2). The red line with squares is the performance of (P3), and the black dash 
is calculated by | log7|/p/. Along all the scales, the red curve is below the blue one, which 
indicates that Pollak's detection delay is smaller than Lorden's detection delay. We also notice 
that these three curves are parallel to each other, which confirms that the proposed strategy, 
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Fig. 4: Detection delay v.s. the system level ARL 



(fi*,T p ), is asymptotically optimal since the difference between them is negligible as 7 — >■ 00. 

In the last scenario, we examine the asymptotic optimality of (jl*, T p ) for (P2) and (P3) in the 
extension case that the energy arrives randomly both in amount and in time. In the simulation, 
we use C = 3, and we assume that the amount of energy arrives at each time slot takes values 
in the set V = {0, 1, ... , 4}. In this case, the probability transition matrix is given as 

P0+P1, P2, P3, Pi 

p= PO, PU P2, P3+P4 

0, PO, Pi, Ei= 2 Pi 

0, 0, po, Ei=lPi . 
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In the simulation, we set p = 0.8, pi = 0.1, p 2 = 0.05, p 3 = 0.025, p 2 = 0.025, then the 
stationary distribution is w = [0.0182, 0.0545, 0.2000, 0.7273] T and p = 1 - p w = 0.9964. 





Performance of (P2) 






-■- Performance of (P3) 
|log y|/E(|i*)I 
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Fig. 5: Detection delay v.s. the system level ARL 

In this simulation, we set a 2 = 1 and SNR = 5dB. The simulation result is shown in Figure 
[51 In this figure the blue line with circles is the performance of (P2). The red line with squares 
is the performance of (P3), and the black dash is calculated by | log 7 1 /pi" . Similar to the results 
obtained in the third simulation scenario, along all the scales, Pollak's detection delay is smaller 
than Lorden's detection delay, and these three curves are parallel to each other, which confirms 
that the proposed strategy, (jl*,T p ), is asymptotically optimal as 7 — >• 00. 

VII. Conclusion 

In this paper, we have studied the non-Bayesian quickest detection problem using a sensor 
powered by the energy harvested from the environment. Since the energy harvester collected the 
energy randomly, the quickest detection problem is subjected to a casual energy constraint. Three 
non-Bayesian quickest detection problem setups, namely Lorden's problem under the algorithm 
level ARL, Lorden's problem under the system level ARL and Pollak's problem under the system 
level ARL, have been considered. For the binary energy arriving model, we have shown that 
the immediate power allocation scheme coupled with CUSUM detection procedure is optimal 
for the first setup, and is asymptotically optimal for the second and the third setup as ARL 
goes to infinity. For the more general energy arriving model, we have shown that the proposed 
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generalized immediate power allocation coupled with CUSUM is still asymptotically optimal 
for the second and third setups. 

Appendix A 
Proof of Lemma [33] 

We first introduce a notion of quasi change point. For any realization of the power allocation 
fx, the quasi change point of the non-trivial observation sequence is defined as 

n = mf{k : X k ~ f x } = inf{k : b k > t}. (21) 

This implies that n can be viewed as the change point happening in the non-trivial observation 
sequence |x^° fe ' 6t: H. Therefore, a rule minimizing the detection delay (T — t) + among {Z k } is 
the same as the one minimizing (N — n) + among |x^ afe,6 ^|. Specifically, the stopping rule is 
decided by 

min sup esssup E n [(N — n + l) 4 "!^.!] , 

N n>l 
S.t. Eqo [N] > Tf. 

This is the classical Lorden's quickest detection problem [HI, and the optimal solution is given 
as Page's stopping time T p in [|T4|] with threshold B, which is a constant solely related to rj and 
achieves EqoITV] = 77. 

To prove the optimality of fx*, we examine the following problem: 

s.t. EoofiV] = rf. (22) 

Notice that the objective function is the same as di(fx,T p ). Since 

Et[T p ] = E$[b N ] > E»[a N ] ( = } Ef [T p ], 

in which inequality (a) is due to ©, and equality (b) is true because T p = o,n under fx* = v. 
Therefore, fx* is optimal for the problem (1221 . 
Since 

mindi(/i,T) = d^fx* ,T P ) = d t (fx*,T p ), 
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in which the last equality is due to Proposition 13 . 1 L we have 

d(/j,*,T p ) = di(jf,Tp). 

Combining this with the fact that 

d(fi,T) > di(ji,T), 
we know that (fi*,T p ) is the optimal solution for (PI). 

Appendix B 
Proof of Proposition \3A] 

We first examine the quantity Eoo [N] . Consider the non-trivial observation sequence \ x[ ai 



:th 



let Mj denote the indicator of the event that the j repetition of k exits at the upper boundary. 
That is Mj = 1 if the j th repetition exits at the upper boundary, and Mj = if the j th 
repetition exits at the lower boundary. Let J be a stopping time with respect to the sequence 
(ki, Mi), (« 2 , M 2 ), . . ., which is i.i.d. under such that J = inf{j : Mj = 1}. One can check 

that N = J2j=i K r 

From Wald's identity, we have 

" j 

E 



K 3 



IooMEcM. (23) 



Eoo[N] =E 

It is easy to see that, under P^, J is a geometric random variable with 

p oc ( j = j) = [i - pm)} IPMW" 1 , j = i,2,... 

Then, we have 



Substituting d24J) into d23j), we have (ITOb . 

Following the similar argument as above, we get 



Ei [TV] 



l-Pi(Fo) - 

Denote T{ = di — aj_i as the time interval between two successive observations, the p.m.f. of T{ 
is 

p( Ti = j) = (i- P y- 1 p, 
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and the average of the time interval between two successive observations is 

1 



E" 



For the average detection delay, we have 

d(ji*,T p ) 



P 

Ef[T p ] 
Ef[a N ] 

N 



Ef 



E 

i=i 



Ti 



(a) 



E v [r] Ei [N] 

= -E^N}. 
P 

Here, (a) is due to the Wald's identity. Then (fTT|) follows. 

Appendix C 
Proof of Lemma ED 

This proof relies on several supporting propositions and Theorem 1 of [Q2 
Proposition C.l: For an arbitrary but given power allocation fx, we have 



f 1 t+q 
lim esssupP/ 1 < — max N HZA > (1 + e)I\ 

m->oo m 0<q<m 

where Ji = pi. 

Proof: We first show that the inequality 



Z t _i ^ > 0, 



(25) 



t+m-l 



— l(Zi) < Ii, as m — y oo, 



(26) 



holds almost surely under Pf for any t > 1. 

To show this, we first consider the immediate power allocation /x*, by the strong law of large 
numbers, we have 



t+m-l 



as m — y oo, 



1 V ^ a.s. 

— > Hi~>P, 

m z — ' 

i=t 
1 ra+m— 1 

- E i(^)^/(/x,/o), 



as m oo, 
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in which n is the quasi change point defined in (12TT) . Therefore, under fx*, as m — > oo, we have 

t-\-m— 1 A ., n+m— 1 

E z(Xi) a 4>/ = /i, (27) 



t+m— 1 

1 E 

i=t 



rh 1 
m m 



where rh is the number of nonzero elements in {/x£, . . . , /4+m-i}- 

For an arbitrary power allocation fx with lim sup^^ = 1, we always have rh < rh + C 
because of the causal energy constraint, where rh denotes the number of nonzero elements in 
{(A t , fx t+m -i}- Therefore, as m ->> oo, 



t+vn-l 



m 



E KZi 



rh 1 



n+fh—l 



i=t 



m m 



< 



m + Cl 



n+rh— 1 



E ' (*) 4 ^- 



For the power allocation scheme /i with lim sup^,^ = 0, we have 

t+m—l 



lim — V Z(Zi) = < pi. 

n.—^nn m < * 



i=t 



Therefore, inequality (1261 ) holds for any arbitrary fx. Notice that i) (1261) holds in the almost sure 
sense, since (|27l) converges in the almost sure sense; and ii) (1261) holds for any realization of 

Z\, . . . , Zt-i. 

For any e > 0, define 



T e * = sup < m > 1 



t+m-l 



m z — ' 



Due to (|26l) , we have 



which indicates 



essinf Pf{T% < oo|Zi, . . . , Z t _ x } = 1, 



f 1 * +<? 
lim esssupP/ 4 < — max > Z(Zj 

m^oo I ??7, 0<q<m ^— ' 



> (1 + eUi 



Note that Proposition IC. 1 1 holds for every t > 1, therefore 



f 1 <+? 
lim sup esssup P/ 1 < — max > Z(Zj 

m-5>oo t>1 777, 0<g<m'^ : — 1 ' 

— i=t 



> (l + s)h 



Z l ,...,Z t _ 1 }^0. 



(28) 
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To prove Lemma |4~T1 we need Theorem 1 in [fT51 , which is restated as follows: 
Theorem C.2: ( KT5\l ) Let {Z k } be a random variables sequence with a deterministic but 
unknown change point t. Under probability measure P t , the conditional distribution of Z k is 

/o(-|Zf -1 ) for k < t and is / x (- 1 Z^" 1 ) for k > t. Denote l(Z k ) as 

l{Z k ) = log 

If the condition 

lim supesssup P t I max > l{Zi) > ii(l + e)m Zi, . . . , Z t _i > — > 0, Ve > (29) 

m— s-oo I 0<q<m ' J 

K i=t ) 

holds for some constant p. Then, as 7 — >■ 00, 

infH/i,T):E 00 [T]> 7 } 

> inf |supE t [T-t|T > t] : E^T] > 7 J 

> (/r 1 + o(i))iog 7 . 

Proof: Please refer to lfl5l . ■ 
In our case, for any arbitrary but given power allocation /i, the conditional density 

/o^lZ*- 1 ) = / (X fe )P ({^ = 1} IZ^ 1 ) + <5(0)P ({^ fe = 0} IZ^ 1 ) , 

where 5(0) is the Dirac delta function. Similarly, we have 

A" (z^z*- 1 ) = A(x fc )p ({^ = 1} iz*- 1 ) + 5(<p)p ({/i fe = 0} iz^ 1 ) . 

Therefore, the log likelihood ratio in Theorem IC.2I 



l(Z k ) = log 



/f(^izn j bggi}, if ma, = 1 



^(Z fc |Z^) 0, ifAi* = 

which is consistent with the definition in ([8]). Moreover, (1281) indicates that, for any arbitrary 
power allocation, (l29l holds for the constant ii = pL Therefore, the conclusion in Theorem IC.2I 
indicates the result for our case: 

inf{d( At ,T):E^[T]> 7 } 

> inf|supEnT-t|T>t]:E^[T]>7} 

> (/r 1 + o(l))log 7 . 
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Appendix D 
Proof of Lemma 03] 

First, a result similar to Proposition 13 . 1 1 still holds in this case. Specifically, 

Proposition D.l: (//*, T p ) is an equalizer rule for (P2), i.e., we have d t (n*, T p ) = T p ),Wt > 

1. 

Proof: This proof is similar to that of Proposition 13.11 Hence, we omit the proof for brevity. 

■ 

The rest of the proof can be shown by discussing the relationship between CUSUM and one 
sided SPRT. Denote SPRT statistic as 

k 

Ai :k = Y[L{Zi), (30) 

i=l 

and the stopping time as 

T Sil = inf{£;> l|A 1:fe >5}. 



Since the CUSUM statistic 



Sh = max 

l>q>k 



i=q 



> L(Zi) = A 1: jt, 



i=i 



we always have 

Ef [T p ] <Ef [T S)1 ] 



By the performance of SPRT (Proposition 4.11 in ffT3l ). we have 

Noting that d(n*,T p ) = di(n*,T p ) = Ef[T p ] and using Lemma gj] we have 



7/ * m \ 1 I log 7 1 
d{fi ,l p 



-pi ~ 



Moreover, by (10) in Theorem 2 of [|9), the threshold B = 7 will guarantee 
The proof is complete. 
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Appendix E 
Proof of Lemma 03] 

We again consider the one sided SPRT with the threshold B = 7, which will guarantee 

E£(T P )>7. 

Let T s >t denote the stopping time of SPRT starting at time instant t, i.e., 



T s t = inf < m > 1 



t+m-l 

n w > b 

then Page's stopping time can be written as 

T p = mf{T s , t + t-l\t= 1,2,...}. (31) 

Note that 

{T p <t} = {T Sjl < U . . . U {T s>4 _! < 1} G JU, 

therefore, 

{T p > t} e T t ^. 

Then, for an arbitrary t, 

Ef[T p -t\T p >t] < Ef[T Sjt -l\T p >t] 
® Ef pT. lt ]-l 
{ = } Ef[T Sjl ]-l. 

Here, (a) is due to (I3TI) . (b) is due to the fact that T s <t is independent of Tt-x, and (c) is true 
because {Z k ys are conditionally i.i.d. under /i*. 
From Appendix [Ql we have 

Ef[r. tl ]~i^l 

Combining this with Lemma |4~T1 we have 

^ ^ ±lrr , ^ ± , 1 |log7| 



supEj 1 [T p - t\T p > t] 



t>i 



p I 
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Appendix F 
Proof of Lemma I5T1 

We first have the following supporting proposition. 

Proposition El: W\j±*\ exists, and < E"[/i*] < 1. 

Proof: We show that E k is a regular Markov chain with a finite number of states. It is easy 
to see that E k have only C + l possible states. If at the end of the previous time slot, the battery 
has zero energy left, then the transition probability is given as 

P v {E k+1 = 0\E k = 0)=p +p 1 , 

P u (E k+1 = j - l\E k = 0) = Pj , for 1< j < C 

oo 

P"(E k+1 = C\E k = 0) = Pi- 

j=C+l 

If at the end of the previous time slot, the sensor has i(l < % < C) units of energy left, the 
transition probability is given as 

P u (E k+1 = i - l\E k = i) = po, 

P v {E k+l = i + j - l\E k = i) = Pj , for 1 < j < C - i 

oo 

P v {E k+1 = C\E k = i) = Pr 

j=C-i+l 

The above transition probability indicates E k is a regular Markov chain. We denote the stationary 
distribution as w = [w , wi, . . . , wc] T , where Wi is the stationary probability for the state E k = i, 
then we have 

= 1-P"\fi, = 0] 

= 1 - P v [v k = 0] P v [Em = 0] 

= 1 — poWo as k — > oo 

exists, and < E v \fi%] < I. ■ 

We denote p = W\pL*\. The rest of the proof follows the one in Appendix O by replacing p 
with p. 
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Appendix G 
Proof of Lemma I5T21 

We first prove the asymptotic optimality of (/2*,T P ) for problem (P2). The proof relies on 
some supporting propositions and Theorem 4 of [fl5l . 

Proposition G.l: For the power allocation scheme u*, we have 



lim sup esssup P/* 



k+m 

- KZi) <pi-s 

i=k 



Zi, 



Z k -i > -> V<5 > 0. 



(32) 



fc>t>i 

Proof: As we have shown in Proposition IC.ll for any realization of Zi, . . . , Zk-i, and 
VA; > t, under the power allocation scheme jl*, we have 

k+m— 1 



— V /(Z^ ^4' pT, m ->■ 
m z — ' 



oo. 



Then 

lim esssup 

m—Hx> 

for all k > t. Therefore 



i=k 



k+m 



m z — ' 

i=k 



> 5 



Zi, . . . , Z 



k-l 



V5 > 0, 



k+m 

— 7 l(Zi) < pi — 5 

171 Z — ' 

i=k 



Z\i ■ ■ ■ , Z 



k-l 



^0 



because the above the expression holds for every k >t. Then the proposition follows. 

Proposition G.2: Under the power allocation scheme jl*, Page's stopping time T p satisfies 



where 



but 



sup P£ (k < T p < k + m a ) < a, 

k>l 



lim inf — — ^— ■ > (pi) 1 , 
I log a 



(33) 



logm Q = o(loga) as a 0. 
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Proof: For any k, 



P£(k<T p <k + m a ) 

k+m a —l 

E p £( T P = ~ k ) 

k=k 

k+m a — 1 k 

^ e n ^)>5,3o<j<fc-i 

fc=fc ^ i=k—j 

k+m a — 1 f fc' 

E p -< J] ^)>5,30</<A/-l 

fc=fc Ki'=k'-j' 



(a) 



(6) 



&+m a — 1 f A; 



e p ~ n l ^ - s ' 30 - ^ - k> 

k=k Ki'=l 



^ k+m a ~l 

< E exp(-log J B) 

fc=fc 

= m Q exp(— log-B). (34) 

Here, (a) is true because the likelihood ratio of {Zi} and that of j^ij are the same. Then we 
substitute {Zi} with |^i}> an d change the probability measure correspondingly, i', k! and j' are 
the new indices in {^} corresponding to the original i, k and j in {Zi}. (b) holds because under 
Poo, are i-i-d., then we reverse the sequence, (c) is due to Doob's martingale inequality, 

since under P^, is a martingale with expectation 1. 

By (|34|) . we can simply choose m a = |loga|(p/) _1 + 5, and choose B, the threshold of 
CUSUM, such that m a exp(- log 5) = a. ■ 
To prove Lemma 15 .21 we need Theorem 4 ii) of [fT51 , which is restated as follows: 
Theorem G.3: ( fdS\l) Let {Z k } be a random variables sequence with a deterministic but 
unknown change point t. Under probability measure P t , the conditional distribution of Z k is 
/oGlZ^ 1 ) for jfe < t and is /i(-|Zj -1 ) for k > t. Denote l(Z k ) as 

' (zo=log ii^p)' 

Denote e c as the threshold used in Page's stopping time. Then 

Eoo[T p ] > e c . 
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Denote E t (T) as Lorden's detection delay, i.e., 

%(T) = supesssupE 4 [(T - t + 1) + \Z U . . . , Z t _J . 



t>i 

If V<5 > 0, the condition 



k+m 

— > < Ji — 8 

m ^ 



holds for some constant 7 ls and as a — >• 0, there exists some m a which dependents only on a 
such that 



where 



but, 



Then, 



sup Poo(k <T p <k + m a ) < a, 

k>l 



lim inf — — — r > L 1 , 
I log a 



logm Q = o(loga) as a — > 0. 



E t (T) < (If 1 + o(l))c as c -> oo. 



Proof: Please refer to [[T5l . ■ 
By Proportion IG.ll and IG.2[ (jl* 7 T p ) is a strategy that satisfies the conditions in Theorem 
|G.3i Hence, if we choose c = log 7 and Ii = pi in the theorem, it is easy to verify that 
d(fl*,T p ) < {{pl)~ l + o(l))\ log 7 1 with Eg(Tp) > 7. Therefore, (/2*,T P ) is asymptotically 
optimal for (P2). 

In the rest of this appendix, we show the asymptotic optimality of (p,*, T p ) for problem (P3). 
Lemma G.4: 



supEf [T p - t\T p > t] ~ lUEill. (35) 



t>i 



p I 



Proof: Follow the similar argument in Appendx El we have 

Ef[T p -t\T p >t] < Ef [T Sjt -l\T p >t] 



Ef [T S;t ] - 1. (36) 
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We claim that 



Ef [T Stt \E t =i}< Ef [T s , t |£ t = 0] , for % = 1, . . . , C, 

that is, at the change point t, if there is energy left in the battery, the average detection delay tends 
to be smaller than that of the case with an empty battery. Since Ef [T s>t \ = Ef Ef [T Sjt \E t 
we have 

Ef* [T p - t\T p >t}< Ef [T a>t \Et = 0] - 1. 



Let B = 7, we have 



inf < m > 1 



t+m 



^/(^)>log7 



i=t 



We define a sequence of stopping times {T s ^ , . . . , T s ^ , . . .} in the following manner: 
1) Set E t = 0. Define 



= inf - 



t+m 



^/(Z,)>log 7 



i=t 



2) Set S T („-i) = 0. Define 



= inf { m > 1 



n(n-l) 



+m 



=T ( r D +1 



That is, at change point t, we discard all the energy left in the battery and then start a new SPRT 
under the power allocation jl*. When the previous SPRT stops, we empty the battery again, and 
start a new SPRT immediately. Then, this sequence of stopping time {T^ , . . . , T s ™ , . . . , } are 
independent with the same distribution of T Sjt under E t = 0. Therefore, by the strong LLN, for 
an N that large enough, we have 



M T t {1) + T t {2) 



N N 

rpii) 



_-^n-Ef [T. it \Et = 0], 



where M = Y*=i T .t- Since 

we have 

t+M 

Y,KZi)>N log 7 , 



as 7 — > oo, M — >■ oo, then 



t+M 



— V /(^) > — lo 

i=t 
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that is 




> 



log 7 
pi 



If we ignore the overshoot, we will have 



Ef [T a , t \E t = 0] 



log 7 
pi 



Then, we have 



Ef* [T p - t\T p >t}< 



log 7 
pi 



(1+0(1)). 
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